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HIGH-FREQUENCY DATA? 
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Princeton University and UPMC (Universite Paris-6) 

This paper considers the problem of testing for the presence of a 
continuous part in a semimartingale sampled at high frequency. We 
provide two tests, one where the null hypothesis is that a continuous 
component is present, the other where the continuous component 
is absent, and the model is then driven by a pure jump process. 
When applied to high-frequency individual stock data, both tests 
point toward the need to include a continuous component in the 
model. 

1. Introduction. This paper continues our development of statistical meth- 
ods designed to assess the specification of continuous-time models sampled 
at high frequency. The basic framework, inherited from theoretical models 
in mathematical finance but also common in other fields such as physics or 
biology, is one where the variable of interest X, in financial examples often 
the log of an asset price, is assumed to follow an Ito semimartingale. That 
semimartingale is observed on some fixed time interval [0,T] at discrete 
regularly spaced times iA n , with a time lag A n which is small. 

A semimartingale can be decomposed into the sum of a drift, a continuous 
Brownian-driven part and a discontinuous, or jump, part. The jump part 
can in turn be decomposed into a sum of "small jumps" and "big jumps." 
Such a process will always generate a finite number of big jumps, but it may 
give rise to either a finite or infinite number of small jumps, corresponding to 
the finite and infinite jump activity situations, respectively. In earlier work, 
we developed tests to determine on the basis of the observed sampled path 
on [0,T] whether a jump part was present, whether the jumps had finite 
or infinite activity, and in the latter situation proposed a definition and an 
estimator of a "degree of jump activity" parameter. 
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In this paper, we tackle the last remaining question: Does the semimartin- 
gale need to have a continuous part? In other words, is the Brownian motion 
present at all? From a model specification standpoint, there is a natural sta- 
tistical interest in distinguishing the two situations where a continuous part 
is included or not, on the basis of an observed sample path. When there 
are no jumps, or finitely many jumps, and no Brownian motion, X reduces 
to a pure drift plus occasional jumps, and such a model is fairly unrealistic 
in the context of most financial data series, although it may be realistic in 
some other contexts. But for financial applications one can certainly con- 
sider models that consist only of a jump component, plus perhaps a drift, if 
that jump component is allowed to be infinitely active. 

Many models in mathematical finance do not include jumps. But among 
those that do, the framework most often adopted consists of a jump-diffusion: 
these models include a drift term, a Brownian-driven continuous part and 
a finite activity jump part (see, e.g., [6, 7] and [16]). When infinitely many 
jumps are included, however, there are a number of models in the literature 
which dispense with the Brownian motion altogether. The log-price process 
is then a purely discontinuous Levy process with infinite activity jumps or, 
more generally, is driven by such a process (see, e.g., [9, 10] and [14]). 

The mathematical treatment of models relying on pure jump processes is 
quite different from the treatment of models where a Brownian motion is 
present. For instance, risk management procedures, derivative pricing and 
portfolio optimization are all significantly altered, so there is interest from 
the mathematical finance side in finding out which model is more likely to 
have generated the data. 

For all these reasons, it is of importance to construct procedures which 
allow us to decide whether the Brownian motion is really here, or if it can be 
forgone in favor of a pure jump process. This is the aim of this paper: we will 
provide two tests allowing for a symmetric treatment of the two situations 
where the null hypothesis is that the Brownian motion is present, and where 
the null is that the Brownian motion is absent. 

In the context of a specific parametric model, allowing for jump compo- 
nents of finite or infinite activity on top of a Brownian component, [8] find 
that the time series of index returns are likely to be devoid of a continuous 
component. An alternative but related approach to testing for the presence 
of a Brownian motion component to the one we propose here is due to [17]. 
They employ the test statistic for jumps of [5], plot its logarithm for dif- 
ferent values of the power argument and contrast the behavior of the plot 
above two and below two in order to identify the presence of a Brownian 
component. A formal test is constructed under the null hypothesis where a 
continuous component is present. 

The methodology that both [17] and we employ to design our respective 
test statistics is based on tried-and-true principles that originate in our 
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earlier work on testing whether jumps are present [5], whether they have 
finite or infinite activity [3] and on estimating the index of jump activity 
[4], although, of course, exploited in a manner specific to the problem at 
hand. We compute power variations of the increments, suitably truncated 
and/or sampled at different frequencies. Exploiting the different asymptotic 
behavior of the variations as we vary these parameters gives us enough 
flexibility to accomplish our objectives. As is well known, powers below two 
will emphasize the continuous component of the underlying sampled process. 
Powers above two will conversely accentuate its jump component. The power 
two puts them on an equal footing. Truncating the large increments at a 
suitably selected cutoff level can eliminate the big jumps when needed, as was 
shown by [15]. Finally, sampling at different frequencies can let us distinguish 
between situations where the variations converge to a finite limit, in which 
case the ratio of two variation measures constructed at different frequencies 
will converge to one, from situations where the variations converge to either 
zero or diverge to infinity, in which case the ratio will typically converge 
to a different constant. Since these various limiting behaviors are indicative 
of which component of the model dominates at a particular power, they 
effectively allow us to distinguish between all manners of null and alternative 
hypotheses. 

This said, the commonality of approach should not mask the fact that each 
situation is, in reality, mathematically quite different. By nature, certain 
components of the model are turned off under particular null hypotheses. For 
instance, when the null hypothesis is that no Brownian motion is present, as 
will be the case for our first test here, then jumps drive the asymptotics. As a 
result, the driving component of the model that matters for the asymptotic 
behavior of the statistic will vary with the situation and consequently the 
methods employed behind the scenes to obtain the desired asymptotics will 
vary accordingly. 

The paper is organized as follows. Section 2 describes our model and the 
statistical problem. Our testing procedure is described in Section 3, and 
the next two Sections, 4 and 5, are devoted to a simulation study of the 
tests and an empirical implementation of our tests on high-frequency stock 
returns. Section 6 is devoted to technical results and to the proof of the main 
theorems. 

2. The model. The underlying process X which we observe at discrete 
times is a 1-dimensional ltd semimartingale defined on some filtered space 
(fi, J 7 , (J r t)t>o,P), which means that its characteristics (B,C,is) are abso- 
lutely continuous with respect to Lebesgue measure. B is the drift, C is the 
quadratic variation of the continuous martingale part and v is the compen- 
sator of the jump measure fi of X. In other words, we have 
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(1) 



u{uj,dt,dx) = dtF t {uj,dx). 



Here b and a are optional process, and F = Ft(u,dx) is a transition mea- 
sure from Q x M + endowed with the predictable c-field into M. \ {0}. More 
customarily, one may write X as 



where W is a standard Brownian motion. It is also possible to write the 
last two terms above as integrals with respect to a Poisson measure and its 
compensator, but we do not need this here. This is a standard setup and we 
refer the reader to [13] for details. 

We have referred above to "small jumps" and "big jumps." In the context 
of (2), they are represented, respectively, by the last two integrals. The size 
cutoff 1 adopted here is arbitrary and could be replaced by any fixed e > 0, 
a change which amounts merely to an adjustment to the drift term Bt- Note 
that the small jumps integral needs to be compensated by u since there are 
potentially an infinite number of such small jumps. The large jump integral 
is always a finite sum; it may be compensated if desired but this is not 
necessary. Any compensation or lack thereof is then again absorbed by an 
adjustment to the drift. 

We now turn to the assumptions. As usual for tests, the assumptions 
essentially ensure that one can compute and then estimate a significance 
level under the null hypothesis. So here, we need some structure for the 
jumps of X, namely that the small jumps essentially behave like the small 
jumps of a stable process with some index /3, up to a random intensity. As 
noted above, when no Brownian is present, we view the realistic situation as 
one where there are infinitely many small jumps. When the null is that there 
is a Brownian motion, we need the additional assumption that the volatility 
process at is itself an ltd semimartingale. 

We would like to give tests with a prescribed asymptotic level, as n — > oo, 
and, of course, this is more difficult when (3 increases because then the 
process resembles more and more a continuous process plus a few big jumps: 
The qualitative behavior of the paths can become quite similar whether the 
Brownian motion is present or not. So, unsurprisingly, we can exhibit a test 
with prescribed level, for the null hypothesis where the Brownian motion is 
present, only when j3 < 1. The parameter f3 is typically unknown (although 
a method for estimating j3 in this setting is given in [4]). On the other hand, 
for the null hypothesis where the Brownian motion is absent we provide a 
test which works under no assumption on /?. 



(2) 
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With this context in mind, here is the first assumption which will be 
assumed throughout: 

Assumption 1. (i) The drift process bt is locally bounded and the volatil- 
ity process at is cadlag. 

(ii) There are three constants < /3" </?'</?< 2 and a locally bounded 
process Lt>l, such that the Levy measure Ft is of the form Ft = F[ + F" , 
where 

( 3 ) F t( dx ) = |^jT+0 K 1 { 0< X < Z ^} +a t h-^KxKO}) 6 *' 

where af , CLj~ , , z^ are nonnegative predictable processes and f = f(uj,t,x) 
is predictable function (meaning V <g> B(M) -measurable, where V is the pre- 
dictable a-field on [0, oo) x VI), satisfying 

t <4 +) <i, i- <4~ ] <h 

Lif Lit 

A t := a| +) + a\ ] <L U \f(t,x)\<L t , 
and where F" is a measure which is singular with respect to F[ and satisfies 

(5) J{\xf M)Fl'{dx)<L t . 

This assumption is identical to Assumptions 1 and 2 of [4] [with some no- 
tational changes: (7, j3' ,af ,a^~) in that paper are called here (/3 — f3' , /3" , (3af , 
fia^ ) , and the condition f3" < f3' is not a restriction and is put here only for 
convenience] . 

For example, take a process solution of the stochastic differential equation 

(6) dX t = b t dt + a t dW t + 8 t - dY t + 5' t _ dY[ , 

where 5 and 5' are cadlag adapted processes, Y is /3-stable or tempered 
/3-stable and Y' is any other Levy process whose Levy measure integrates 
\x\p near the origin and has an absolutely continuous part whose density 
is smaller than ET|x| _ ( 1+ ^ ) on [—1, 1] for some K > (e.g., a stable process 
with index strictly smaller than (3'). Then X will satisfy Assumption 1. 

If this assumption is satisfied with j3 < 1, then almost surely the jumps 
have finite variation ^2 s<t |AX S | < 00 for all t or equivalently, J \x\/j,(ds, dx) < 
00. This allows us to decompose X into the sum X = X' + X" , where 

(7) X[ = X + I b' s ds + f a s dW s , X' t ' = J2 & X s , 

Jo Jo s < t 

and where b' t = bt — f xln x \<nFt(dx) is a locally bounded process. 
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For clarity, we will derive the properties of both tests under the same 
generic Assumption 1 even though the properties of the test for the null of 
a Brownian present remain valid under weaker assumptions. When the null 
hypothesis to be tested is that the Brownian motion is present, it becomes 
the driving process for our test statistic and as is customary for tests or esti- 
mation problems involving a stochastic volatility, we then need an additional 
regularity assumption on the a process: 

Assumption 2. We have Assumption 1 with ft < 1. Moreover the volatil- 
ity process o~t is an ltd semimartingale, that is, it can be written (necessarily 
in a unique way) as 

(8) a t = a + [ b s ds + [ a s dW s + N t + V Acr s l{|A CTs |>i}, 

Jo Jo s < t 

where N is a local martingale which is orthogonal to the Brownian motion 
W, and further the compensator of the process [N, N] t + Yls<t 1{|a<t s |>i} *s 
of the form J^n s ds. Moreover we suppose that: 

(i) the processes bt and nt are locally bounded; 

(ii) the processes at and b' t defined above are cadlag. 

3. The two tests. 

3.1. The hypotheses to be tested. In a semimartingale model like (2), say- 
ing that the Brownian motion W is absent on the interval [0,T] does not 
mean that there is no Brownian motion on the probability space (something 
which cannot be tested at all, obviously) but it means that the Brownian 
motion does not impact the observed process X, in the sense that the corre- 
sponding stochastic integral vanishes on this interval, or equivalently a s = 
for Lebesgue-almost all s in [0, T], and it would be more appropriate to say 
that we are testing whether "the continuous martingale part of X vanishes 
on [0, T], or not." This is typically an w-wise property: we can divide the set 
Q into two complementary subsets 

(9) n ^ = \J a ^ s>0 }' n ™ w = y o cr 2 s ds = o\. 

Then almost surely on the set W the integral process X£ = J * a s dW s 
vanishes on [0, T], whereas it does not vanish on the complement tVp . In 
what follows, we take to represent the hypothesis that the Brownian 
motion is present and Q^ oW to represent the hypothesis that the Brownian 
motion is not present. 
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In connection with Assumption 1 we consider the following set represent- 
ing paths that have infinite jump activity of some index (3 £ (0,2): 

(10) n T 3 = {A T >0} where A t = [ A s ds. 

Jo 

One knows that on the set Otp the path of A over [0,T] has almost surely 
infinitely many jumps. 

We are interested in testing the following two situations: 



Ho-.n™ vs. H x \Sl T oW , 
Hq:£I t oW vs. Hx-.D™. 



(11) 

As discussed above, the realistic situation supposes that infinite activity 
jumps are present when under Q, T oW and so we will in fact provide testing 
procedures for the following two situations: 



(12) 



Ho-.nf vs. ^-.n^ nni^, 

H -.n T oW nnif 3 vs. ffi:fifnnj?. 



a = sup(limsupP(C„ | A) : A G T, A C n ,¥(A) > J , 
P = inf (lim inf P(C n | A) : A £ J", A C fii , F(A) > o) . 



In the second test, requiring 0^ with under H± allows us to characterize 
precisely the properties of the statistic under this alternative (as opposed to 
just flip). But it is not necessary for the actual implementation of the test 
which relies on its behavior under the null. 

Finally, we recall that testing a null hypothesis "we are in a subset Oo" 
of fi, against the alternative "we are in a subset fix," with, of course, f^o H 
ill = 0, amounts to finding a critical (rejection) region C n C O at stage n. 
The asymptotic size and asymptotic power for this sequence (C n ) of critical 
regions are the following numbers: 



(13) 



3.2. The building blocks. Before stating the results, we introduce some 
notation to be used throughout. We observe the increments of X 

(14) A?X = X iAn -X {i _ 1)An , 

to be distinguished from the (unobservable) jumps of the process, AX S = 
X s — X s _ . In a typical application, A is a log-asset price, so A" A is the 
recorded log-return over A n units of time. 

For any given cutoff level a>0we count the number of increments of A 
with size bigger than u, that is, 

[t/A„] 

(15) U{u,A n ) t = l{\Afx\>u}- 

i=i 
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If p > we also sum the pth absolute power of the increments of X , truncated 
at level u, that is, 

[t/A n ] 

(16) B(p,u,A n ) t = £ \A?X\n mx] < u} . 

i=i 

B is what we call a "truncated power variation." Note that in B we are 
retaining all increments smaller than u, whereas in U we are retaining those 
larger than u. 

We take a sequence u n of positive numbers, which will serve as our thresh- 
olds or cutoffs for truncating the increments, and will go to as the sampling 
frequency increase. There will be restrictions on the rate of convergence of 
this sequence, expressed in the form 

(17) u n /A p n 0, u n /A p n + -> oo for some < p_ < p+ < \. 

This condition becomes weaker when p + increases and when p_ decreases. 

In practice, when a Brownian motion is present, we will often translate 
values of the cutoff level u n in terms of a number of standard deviations of 
the continuous part of the semimartingale. That is, we express values of u n 
in terms of ct n where u n = ot n (t" 1 / * a 2 s ds) l l 2 All 2 . Despite the presence of 
jumps, the integrated volatility in that expression can be estimated using 
the small increments of the process, since 

[*/An] t 

(18) \^ X \ 2l mX\<a^}^ / O^ds 
i=l Jo 

for any a > and w G (0,1/2). We can then vary the cutoff level a n to 
yield a number of (estimated) standard deviations of the continuous part of 
the semimartingale. This data-driven choice can help determine a range of 
reasonable values for the cutoff level and provide on a path-by-path basis 
an equivalent, but perhaps more intuitive, scale with which to measure the 
magnitude of the cutoff level u n . 

When there is no Brownian motion under the null, a different scale needs 
to be used to assess the size of u n . For example, we can translate u n into the 
percentage of the sample that is greater than the cutoff level, and therefore 
not included in the computation of the truncated power variations. 

3.3. Testing for the presence of Brownian motion under the null. In a 
first case, we set the null hypothesis to be "the Brownian motion is present," 
that is Qj? , against the alternative QJ]? W n fij? . 

In order to construct a test, we seek a statistic with markedly different 
behavior under the null and alternative. One fairly natural idea is to consider 
powers less than 2 since in the presence of Brownian motion they would be 
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dominated by it, while in its absence they would behave quite differently. 
Specifically, the large number of small increments generated by a continuous 
component would cause a power variation of order less than 2 to diverge 
to infinity. Without the Brownian motion, however, and when p > (3, the 
power variation converges to at exactly the same rate for the two sampling 
frequencies A n and kA n , whereas in the former case the choice of sampling 
frequency will influence the magnitude of the divergence. Taking a ratio will 
eliminate all unnecessary aspects of the problem and focus on the key aspect, 
that of distinguishing between the presence and absence of the Brownian 
motion. 

Specifically, we fix a power p S (0, 2) and an integer k>2, and we consider 
the test statistics, which depend on p and on the terminal time T and on 
the sequence u n subject to (17), as follows: 

(19) S n - B ^ u - A ^ 



B(p,u n ,kA n ) T ' 

As will become clear below, taking ratios of power variations has the ad- 
vantage of making the test statistic model-free. That is, its distribution under 
the null hypothesis can be assessed without the need for the extraneous esti- 
mation of the dynamics of the process in (2). Obviously, these dynamics can 
be quite complex with potentially jumps of various activity levels, stochas- 
tic volatility, jumps in volatility, etc. So the fact that the standardized test 
statistic can be computed without the need to estimate the various parts of 
(2) is a desirable feature. In fact, implementing the test — that is, comput- 
ing the statistic in (19) and estimating its asymptotic variance — will require 
nothing more than the computation of various truncated power variations. 

The first result is a law of large numbers (LLN) giving the probability 
limit of the statistic S n . 



Theorem 1. Under Assumption 1 and ifp£ (1,2), we have 
k 1 ^ 2 , on the set , 

1, ontheset£l™ w C\StJ r 3 ,ifp>l3\jl,p + < 

(20) 

This result shows that, since k l ~ p l 2 > 1, for the test at hand an a pri- 
ori reasonable critical region is C n = {S n < c n }, for a sequence c n increasing 
strictly to k 1 ~ p ^ 2 : in this case the asymptotic power is 1 in restriction to the 
set described in the second alternative above, whereas the asymptotic level 
depends on how fast c n converges to k x ~ p l 2 . 

For a more refined version of this test, with a prescribed level a S (0, 1), 
we need a central limit theorem (CLT) associated with the convergence in 
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(20). For this we need some notation: letting Z and Z 1 be two independent 
jV(0, 1) variables, we set 



(21) 



(m p = E(\Z\P), 
m k , P = E( | Z \p | Z + y/k^lZ'\ p ) , 

N(p, k) = — (k 2 ~ p (l + k)m 2p + k 2 ~ p (k - l)ml - 2k 3 ~ 3 P/ 2 m k , p ). 
m 2p 



In terms of known functions, we have 



(22) 



TO, 



2 P/2 + 1 



r 



m fe , p = ^(fe-ir/ 2 r( 



1 + r 



^2,1 



p p + 1 1 

2' 2 



where i^i is Gauss's hypergeometric function (see, e.g., Section 15.1 of [1]). 

£— (s) 

Then the standardized version of the CLT goes as follows (we use — > 
to denote the stable convergence in law (see, e.g., [13] for this notion); to 
explain the following statement, we recall that the convergence in law "in 
restriction to a subset Oo" is meaningless, but the stable convergence in law 
in restriction to Qq makes sense): 



Theorem 2. Suppose that Assumption 2 holds, take p S (1,2) and let 
the sequence u n satisfy (17) with p_ > ■ Then we have the following 

convergence in law: 



(23) (S n - k 1 - p/2 )/^/V^ C -^ > A/"(0, 1) in restriction to 
where 

B(2p,u n ,A n ) T 



w 

T ; 



(24) 



V n = N(p,k) 



{B(p,u n ,A n ) T f 



We are now ready to exhibit a critical region for testing Hq : tij? vs. 
Hi : il^° w n using S n with a prescribed asymptotic level a 6 (0, 1). De- 
noting by z a the a-quantile of iV(0, 1), that is, F(Z > z a ) = a where Z is 
iV(0,l), we set 



(25) 



C n — {S n < k P^ Z a y/V n }. 



Theorem 3. Suppose that Assumption 2 holds. Let p£ (1,2) and let the 
sequence u n satisfy (17) with 



(26) 



p — 1 
2p-2(3 



P- <Ph 



P-1 



{hence p > 2/3). 
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Theoretical p-Limits of S n 



H : Brownian 
+ Cauchy 



H : Brownian 
+ Cauchy 




Fig. 1. Probability limits as a function of p of the test statistic S n with k — 2 in the case 
of a Cauchy process (j3 = 1, Ho) and a Brownian plus Cauchy processes (Hi). 

Then the asymptotic level of the critical region defined by (25) for testing the 
null hypothesis "the Brownian motion is present" (i.e., Qj( against ^l^ oW 
&t ) egua/s a, and the asymptotic power equals 1. 



n 



To perform the test we need to choose p and the sequence u n . In practice 
one does not know /3, although it should be smaller than 1 by Assumption 
2. Hence if we are willing to assume that /3, although unknown, is not bigger 
than some prescribed f3o < 1, one should choose p G (2/3o,2), and one may 
take u n = aA" for some a > and some w G (0, 1/2), and the test can be 
done as soon as 

(27) 



2p - 2/3 c 



< w < 



p — 1 



P 



To properly separate the two hypotheses it is probably wise to choose p 
closer to 2/3q than to 2. 

Remark 1. The first part of the consistency result (20) holds also for 
p G (0, 1] on tij? (with basically the same proof). The second part also holds 
for (3 < p < 1 on the set on which Xt = Xq + ^2 s<t AJ S for all t<T, that is, 
when there is no drift, whereas when there is a drift, S n converges to k l ~ p for 
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all p G (0, 1]. When < p < (3 the limit of S n is k x ~ v ^ on Q™ w n ft^when 
p > 1, and also when p < 1 when again there is no drift (and the proof is 
more involved). Figure 1 illustrates these various limits in the case X is 
the sum of a Brownian martingale plus possibly a Cauchy process (with no 
drift). 

Remark 2. The CLT necessitates p£ (1>2). However, more sophisti- 
cated techniques would allow us to prove the same result for all p £ (0,2), 
under the additional assumption that at does not vanish for t £ [0,T], on 
the set (we still need j3 < 1, however). 

Remark 3. Despite the fact that using powers less than 2 is the most 
natural way to isolate the contribution of the Brownian motion to the overall 
increments of the process, it is possible to design an alternative test that 
relies on powers greater than 2. Instead of the statistic S n above, we could 
use the following statistic: pick 7 > 1 and p' > p > 2, and set 

, 28 ^ £ = B(p', -yu n ,A n ) T B(p, u n , A n ) T B(2, -yu n ,A n ) T 

B(p',u n ,A n ) T B(p,>yu n ,A n ) T B(2,u n ,A n ) T ' 

Under Assumption 1, S n converges in probability to 7 P ~ p on the set Vt^ n 
n^, and to 1 p'-p+ 2 -' 3 on the set QJ}° W n O^, as soon as p+ < We 

also have a CLT under Assumption 2 and if p + < jj^Eio • Under 5*n is 
model-free, just like is. So one can, in an obvious way, construct a test 
based on S n and which satisfies the claims of Theorem 3, under suitable 
conditions on the cutoff levels u n . However, simulations studies suggest that 
the statistic S n is not as well behaved as S n , and so we do not pursue its 
study further. 

3.4. Testing for the absence of Brownian motion under the null. In a 
second case, we set the null hypothesis to be "the Brownian motion is ab- 
sent," that is, n™ w C\^. Desi gning a test under this null is trickier because 
the model becomes a pure jump (plus perhaps a drift) process, and we are 
aiming for a test that remains model-free even for this model. That is, we 
are looking for a statistic whose limiting behavior under the null, despite 
being driven by what is now a pure jump process, does not depend on the 
characteristics of the pure jump process, such as its degree of activity /3, 
since those characteristics are a priori unknown. 

This can be achieved as follows. We choose a real 7 > 1 and a sequence 
u n satisfying (17) and define the test statistic 

1 ' n B(2, Uri ,A n ) T U( 1Un ,A n ) T ' 
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To understand the construction of this test statistic, recall that in a power 
variation of order 2 the contributions from the Brownian and jump com- 
ponents are of the same order. But once the power variation is properly 
truncated, the Brownian motion will dominate it if it is present. And the 
truncation can be chosen to be sufficiently loose that it retains essentially all 
the increments of the Brownian motion at cutoff level u n and a fortiori ~fu n , 
thereby making the ratio of the two truncated quadratic variations converge 
to 1 under the alternative hypothesis. On the other hand, if the Brownian 
motion is not present, then the nature of the tail of jump distributions is 
such that the difference in cutoff levels between u n and ^yu n remains mate- 
rial no matter how far we go in the tail, and the limit of that same ratio 
will reflect it: it will now be j 2 ~P under assumptions made specific in the 
formal theorems below. Since absence of a Brownian motion is now the null 
hypothesis, the issue is then that this limit depends on the unknown /3. 

Canceling out that dependence is the role devoted to the ratio of the 
number of large increments, the C/'s, in (29). The JJ's are always dominated 
by the jump components of the model whether the Brownian motion is 
present or not. Their inclusion in the statistic is merely to ensure that the 
statistic is model-free, by effectively canceling out the dependence on the 
jump characteristics that emerges from the ratio of the truncated quadratic 
variations. Indeed, the limit of the ratio of the IPs is 7^ under both the null 
and alternative hypotheses. As a result, the probability limit of S' n will be 
7 2 under the null, independent of ft. 

Our first result states this precisely, establishing the limiting behavior of 
the statistic in terms of convergence in probability: 

Theorem 4. Let the sequence u n satisfy (17), and suppose that As- 
sumption 1 holds. Then 

f v s , JP^ ( 7 2 , on the set Sl™ w n Stij! , 

n ^\^, on the set nfnCt^. 

For a test with a prescribed level we need a standardized CLT. 

Theorem 5. Suppose that Assumption 1 holds with f3" < and /3' < 
^, and (17) holds with p + < A A A Then we have 

(31) (5; - 7 2 )/ v / ^[ £ ^ ) A/"(0, 1) in restriction to Q^ oW n Q^, 
where is given by the following formula: 
l= J B{A,u n ,A n ) T 1 
n 7 \{B{2,u n ,An)T) 2 + U(u n ,A n ) 



(32) 



7 V V (£(2, iu n , A n ,) T ) 2 U( 7 u n , A n ) 
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Hence a critical region for testing Hq : ^lJjf w n vs. Hi : fl 0^ is 
(33) C' n = {S l n < 1 2 -z a ^VI l }. 

Theorem 6. Suppose that Assumption 1 holds with ft" < and /?' < 
^, and (17) holds with p + < A ^ A T/ien i/ie asymptotic level of 
the critical region C' n defined by (33) for testing the null hypothesis "the 
Brownian motion is absent" (i.e., Q,^ oW nfl^ against Qj( n Ql^ ) equals a, 
and the asymptotic power equals 1. 

If we take again u n = a A™ , the test can be performed if a > and 

12 2-/3 

(always smaller than 1/2). This requirement is constraining, because f3 is 
unknown, and may typically be close to 2 if we believe in the null hypothesis. 
Therefore in practice we must assume that f3 does not exceed a given /3q £ 
[1,2). This means that this limiting index (3q is given a priori, and we do 
the test under the Assumption 1 with 2/3' < j3 < /3q and /3" < , with w 
subject to the (feasible) condition 

(35) 0<w< ™. 

3po 

These facts are not really surprising: first, by (30) we know that the statistic 
S' n properly separates the two hypotheses only when /3 is not too close to 
2. And, second, when f3 becomes very close to 2, the paths of X have big 
jumps but also the compensated sum of small jumps looks more and more 
like a Brownian path, even on the set Q^P W . 

Remark 4. It is possible to design an alternative statistic with similar 
properties but make no use of the C/'s. Instead of the statistic S' n in (29), 
we could use the following statistic: pick 7 > 1, k > 1 and p > 2, and set 

/ 36 ^ af = £ (2, Un, A n ) T B(p, K-yUn, A n ) T 

n ~ B(2,~fu n ,A n ) T B(p,Ku n ,A n ) T ' 

Under Assumption 1, S' n converges in probability to j p ~ 2 on the set Q™ w n 
Qj!, and to j p ~P on the set Qj( n , as soon as p+ < The ratio of pth 
power variations plays a similar role to that of the J7's, namely to cancel 
out the dependence of the p-lim of S' n on /3 under the null. The fixed scaling 
factor k allows us to use different cutoff levels for the two powers p and 2 
without affecting the probability limit of the statistic. We also have a CLT if 
P+ < ^g-- Under Hq, S' n is model- free, just like S' n is, and so a test follows. 
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But as was the case for the statistic S n proposed in (28), simulations studies 
suggest that S' n is not as well behaved as S n . 

Remark 5. In Theorems 2 and 3 the rate of convergence is hidden 
because of the standardization, but it is l/y/A n , clearly optimal since there 
are 1 + [T/A n ] observation altogether. In Theorems 5 and 6 the rate is 

8 /2 

1/un , which is again "optimal" when we only use the increments bigger 
than u n [more precisely, if we were able to observe exactly all jumps of 
X with size bigger than u n , this rate would be the optimal one, up to a 
log(l/it n ) term]. However, for those theorems we also have to choose u n : the 
smallest u n is, compared to A n , the biggest the actual rate is, but we are 
limited in this choice by the upper bound on p + . For example if we take 
u n = a£%, and due to (35), the best rate is "almost" l/A^ 2 "^ 76 *. 

4. Simulation results. We now report simulation results documenting 
the finite sample performance of the test statistics S n and S' n . We calibrate 
the values to be realistic for a liquid stock trading on the NYSE, and we 
consider an observation length of T = 21 days (one month) sampled every 
five seconds. 

We conduct simulations to determine the small sample behavior of the two 
statistics S n and S' n under their respective null and alternative hypotheses. 
The tables and graphs that follow report the results of 5000 simulations. The 
data generating process is the stochastic volatility model dX t = a t dWt + 
9 dY t , with a t = v\ /2 , dv t = £( v -v t )dt + (pv] /2 dB t + dJ t , E[dW t dB t ] = pdt, 
rj 1 / 2 = 0.25, (f> = 0.5, £ = 5, p = —0.5, J is a compound Poisson jump pro- 
cess with jumps that are uniformly distributed on [—30%, 30%] and Xq = 1. 
The jump process Y is a /3-stable process with (3 = 1, that is, a Cauchy 
process (which has infinite activity, and will be our model under fij?; this 
is a borderline case for the statistics S n under the null, nevertheless we 
will see that this statistic behaves well). Given rj, the scale parameter 9 (or 
equivalently A) of the stable process in simulations is calibrated to deliver 
different various values of the tail probability P = ¥(\AY t \ > \tfl 2 A n 12 ). In 
the various simulations' design, we hold r] fixed. Therefore the tail proba- 
bility parameter P controls the relative scale of the jump component of the 
semimartingale relative to its continuous counterpart. We set 6 such that 
neither of the two components of the model, atdWt an d 9Yt, is negligible 
compared to the other when the hypothesis states that they should both be 
present. We achieve this by computing the expected percentage of the total 
quadratic variation attributable to jumps on a given path from the model, 
and set it to values that range from 5% and 95%. 
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4.1. The first test. The statistic S n is implemented with k = 2 and val- 
ues of p that range from to 2 (recall Remark 2). Figure 2 compares the 
theoretical and Monte Carlo behavior of S n as a function of the power p 
under the null hypothesis where a Brownian motion is present, in addition 
to a Cauchy pure jump process. Figure 3 shows the corresponding results 
under the alternative hypothesis, where there is no Brownian motion. The 
theoretical curves are computed from the expected values of the truncated 
power variations using the exact density of the increments at the sampling 
interval A n = 5 seconds, rather than their asymptotic limits for A n — > 0. 
This introduces a slight Jensen's inequality effect in the figure but appears 
to capture well the small sample behavior of the statistic. 

Recall that for concreteness a is expressed as a number of standard de- 
viations of the Brownian part of X: that is, the level of truncation u n is 
expressed in terms of the number a of standard deviations of the contin- 
uous martingale part of the process, defined in multiples of the long-term 
volatility parameter J] 1 / 2 : a is defined by u n = ar] 1 / 2 A}/ 2 . Our view of the 
joint choice of (w, a) is that they are not independent parameters in finite 
sample: they are different parameters for asymptotic purposes but in finite 
samples the only relevant quantity is the actual resulting cutoff size u n . This 
is why we are reporting the values of the cutoffs u n in the form of the a 
that would correspond to w = 1/2. This has the advantage of providing an 
easily interpretable size of the cutoff compared to the size of the increments 
that would be expected from the Brownian component of the process: we 
can then think in terms of truncating at a level that corresponds to a = 4, 6, 
etc., standard deviations of the continuous part of the model. Since the ulti- 
mate purpose of the truncation is either to eliminate or conserve that part, 



Behavior of S n Under the Null Hypothesis 
H Q : Brownian + Cauchy 



Theoretical Behavior of S„ 



Monte-Carlo Behavior of S n 



o . o ^ 
o . o 



• — p o.o 1 — - 

2.0 0.0 



Fig. 2. Theoretical and Monte Carlo behavior of S„ as a function of the power p un- 
der the null hypothesis where a Brownian motion is present, in addition to a pure jump 
(Cauchy) process. 
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Behavior of S n Under the Alternative Hypothesis 
Hi: Cauchy Alone, No Brownian 



Theoretical Behavior of S„ 



Monte-Carlo Behavior of S„ 




p o.o 



Fig. 3. Theoretical and Monte Carlo behavior of S„ as a function of the power p under 
the alternative hypothesis where a Brownian is absent. 



it provides an immediate and intuitively clear reference point. Of course, 
given u n and this a, it is possible to back this into the value of the a cor- 
responding to any w, for that given sample size, including the value(s) of 
w that satisfy the required inequalities imposed by the asymptotic results. 
This approach would lose its effectiveness if we were primarily interested 
in testing the validity of the asymptotic approximation as the sample size 
varies, but for applications, by definition on a finite sample, it seems to us 
that the interpretative advantage outweighs this disadvantage. 

The statistic in the plots is computed with a truncation level correspond- 
ing to a = 7. Table 1 looks at the dependence of the results on the choice of 
a. 

Next, we report in Figure 4 histograms of the values of the unstandardized 
S n computed under Ho : Qjf and H\ : Q T ° w n , respectively, and with the 
same level of truncation a = 7. The vertical lines represent the anticipated 



Table 1 

Testing Ho : vs. Hi : Q™ w n Q, 1 ^ : Monte Carlo rejection rate for the test for the 
presence of a Brownian motion using the statistic S n 



Degree of Test Sample rejection rate (%) for power p 

truncation theoretical 0.25 0.5 0.75 1.0 1.25 1.5 1.75 

~ct level 

6 10% 9.1 9.4 9.4 9.2 9.1 9.3 8.9 

5% 4.6 4.7 4.8 4.7 4.5 4.3 4.1 

7 10% 9.7 9.7 9.8 9.8 9.9 10.2 9.8 

5% 5.0 5.0 5.1 5.0 4.9 4.7 4.4 

8 10% 9.7 9.9 9.9 10.0 9.9 10.1 9.9 

5% 5.0 5.1 5.1 5.1 4.9 4.8 4.5 
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Distribution of S n Under the Null and Alternative Hypotheses 



density 
lOf- 

p - 0.5 



Hi: 
Cauchy 
Alone 



Non- Standardized 
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p = 1.5 



H ; 
Brown i an 
+ Cauchy 



Cauchy 
Alone 



H : 
Brownian 
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Fig. 4. Nonstandardized distribution of S n under the null and alternative hypotheses for 
two values of p. 



limits of the statistic in the two situations, k l ~ p / 2 under Hq and either 
1 when p > (3 or h l ~ p IP when < p < (3 under Hi, based on Theorem 1 
and Remark 1. Since here f3 = 1, the two graphs with p = 0.5 and p = 1.5 
illustrate the two situations where p < /3 and p > /3. 

Figure 5 reports the Monte Carlo distribution of the statistic S n , stan- 
dardized according to Theorem 2, compared to the limiting J\f(0, 1) 
distribution. Table 1 reports the Monte Carlo rejection rates of the test of 
H : tlx vs. Hi : QJf w n ^ at the 10% and 5% level, using the test statistic 
S n , for various levels of truncation a. We find that the test behaves well, 
with empirical test levels close to their theoretical counterparts. 



Distribution of S n Under the Null Hypothesis 
Standardized 
density density 




-4-2024 -4-2024 



Fig. 5. Standardized distribution of S n under the null hypothesis of a Brownian motion 
present for two values of p. The histogram represents the small sample distribution while 
the solid curve is the asymptotic A/"(0, 1) density. 
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4.2. The second test. We now turn to the second problem, that of test- 
ing H : n^ oW n vs. H x : n ntf. For this test, S' n is implemented with 
a second truncation level twice as large as the first, that is, 7 = 2. The 
simulation evidence suggests that the results are largely similar for values 
of 7 within a range of 1.5 to 2.5. Parameter values are identical to those 
employed for the first test. Since there is no Brownian motion under the 
null, the truncation level u n is set in terms of the percentage of observations 
that are excluded by the truncation. For comparison with the truncation 
levels employed in the first test, we report it here again in terms of a, a 
number of standard deviations for the Brownian motion using the same 
parameter values as under the first test's null, or this test's alternative hy- 
pothesis. 

Under the null, the model is driven exclusively by the Cauchy process. 
Figure 6 shows the limiting value of S' n under Hq, as a function of the trun- 
cation level a, comparing the theoretical limit of 7 2 = 4 given in Theorem 
4 (left graph) and the corresponding average value of S' n from the Monte 
Carlo simulations (right graph). Figure 7 shows the corresponding values 
under the alternative hypothesis, where the increments of X are now gener- 
ated by a Brownian motion plus a Cauchy process. The theoretical limit on 
the left graph is computed from the expected values under the exact distri- 
bution of the increments at the sampling frequency A n rather than the p-lim 
7 2_/3 = 2 obtained in the limit where A n — > 0, with the same remark about 
Jensen's inequality applying here. We note that for small truncation levels 
(a = 4) the interaction of the Brownian and the stable processes is material, 
driving the actual limit above 2. If desired, small sample corrections for this 
interaction can be implemented along the same lines as in Section 5 of [4]. 



Behavior of S n ' Under the Null Hypothesis 
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Fig. 6. Theoretical and Monte Carlo behavior of S' n as a function of the truncation level 
a under the null hypothesis where a Brownian motion is absent. The model is a pure jump 
(Cauchy) process. 
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Behavior of S n ' Under the Alternative Hypothesis 
Hi : Brownian + Cauchy 



Theoretical Behavior of S n ' 



Monte-Carlo Behavior of S n ' 



Fig. 7. Theoretical and Monte Carlo behavior of S' n as a function of the truncation level 
a under the alternative hypothesis where a Brownian motion is present, in addition to a 
pure jump (Cauchy) process. 



The test statistic in simulations under the alternative appears to be 
slightly biased upwards. Quite naturally, this effect worsens as the pure jump 
process gets closer to a Brownian motion (for instance if (3 = 1.5 instead of 
0.5 or 1), and/or when the scale parameter 9 of the jump process increases 
since that makes isolating the effect of the Brownian motion component of 
the model relatively more difficult. 

Generally speaking, S' n is, under its alternative, more finicky than S n is 
under either its null or alternative. The reason for this is that S' n requires 
under H\ a Goldilocks- like conjunction of factors whereby the Brownian 
motion component of the model is sufficiently large to drive the behavior 
of the ratio of truncated quadratic variations, while the jump component of 
the model cannot be so small as to render inaccurate the ratio of the number 
of increments larger than the truncation level. 

Figure 8 reports the Monte Carlo distributions of S' n under Hq and H\\ 
the vertical lines represent the theoretical limits. Under H±, we note again 
that S' n is slightly biased upwards. Fortunately, this bias is limited to H\ so 
it does not adversely affect the implementation of the test per se, which is 
based on the behavior of S' n under Hq. But it can affect the interpretation of 
the results of the test implemented on real data, since, as we will see below, 
we will find empirical values of S' n below 4. Figure 9 reports the Monte 
Carlo and asymptotic distribution of the statistic S' n standardized under Hq 
as prescribed by Theorem 5. 

As said above, the histograms are computed using T = 21 days (one 
month) sampled every five seconds. With this length of the series, the empir- 
ical distribution of the statistic is very well approximated by its asymptotic 
A^(0, 1) limit. Shorter time periods (such as T = 1 day) tend to result in 
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Distribution of S n ' Under the Null and Alternative Hypotheses 
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Fig. 8. Nonstandardized distribution of S' n under the null and alternative hypotheses. 

right-skewness of the Monte Carlo distribution of S' n . We do not view the 
need for a longer series as a serious obstacle to the empirical implementa- 
tion of the test since one would not typically expect the Brownian motion 
component of the model to be turned on or off on a daily basis: one would 
expect the market to operate in such a way that the Brownian component is 
either there all the time or not there at all. But if an answer is nevertheless 
desired on a day-by-day basis, then the first test can always be implemented, 
as it requires substantially shorter time spans. 



Distribution of S n ' Under the Null Hypothesis 
Standardized 

density density 




Fig. 9. Standardized distribution of S' n under the null hypothesis of a Brownian motion 
absent for two values of the truncation level. The histogram, represents the small sample 
distribution while the solid curve is the asymptotic M (0, 1) density. 
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Table 2 

Testing H x :Sl™ w C\ 0^ vs. Ho : : Monte Carlo rejection rate 
for the test for the absence of a Brownian motion using the 
statistic S' n 



Sample rejection rate (%) 
for truncation level a 



Test theoretical level 


4 


5 


6 


7 


8 


10% 


9.0 


9.2 


9.1 


9.0 


9.2 


5% 


4.0 


4.2 


4.1 


4.2 


4.2 



Finally, the test's rejection rate under the null hypothesis is reported in 
Table 2. Since the test is one-sided (we reject Hq when the standardized S' n 
is too low), the right-skewness of the statistic visible in Figure 9 results in 
a slight under-rejection by the test. 

5. Empirical results. In this section, we apply the two test statistics 
to real data, consisting of all transactions recorded during the year 2006 on 
two of the most actively traded stocks, Intel (INTC) and Microsoft (MSFT). 
The data source is the TAQ database. Using the correction variables in the 
dataset, we retain only transactions that are labeled "good trades" by the 
exchanges: regular trades that were not corrected, changed, or signified as 
cancelled or in error; and original trades which were later corrected, in which 
case the trade record contains the corrected data for the trade. Beyond that, 
no further adjustment to the raw data are made. 

We first consider the test where the null hypothesis consists of a continu- 
ous component being present. Figures 10 and 11 show the values of the test 
statistic S n , plotted for a range of values of the power p, for the two data 
series. The empirical values of S n are labeled on the plots with numbers 
representing the sampling interval employed, in seconds, with values rang- 
ing from A n = 5 seconds to A n = 30 minutes. In addition to the empirical 
estimates, the figures display the two limits of S n under the null where a 
Brownian is present and the alternative hypothesis where it is absent. The 
theoretical limits correspond to those given in Figures 2 and 3, except that 
the theoretical limit under H\ (no Brownian present) is plotted for a value 
of P = 1.6, in line with the estimates of (5 given in [4] for these data series. 
Quite naturally, the closer (3 is to 2, the closer the jump component can 
mimic the behavior of a Brownian motion and the harder it becomes to tell 
the two hypotheses apart. The limit under Hq is independent of (3. Also on 
the figures are the two limits corresponding to the situation where market 
microstructure noise dominates. We include the two polar cases where the 
noise is either of a pure additive form or of a pure rounding form. 
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Statistic S„ : INTC 2006 Transactions, Various Frequencies 
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Fig. 10. Empirical estimates of S„ at various values of p and sampling frequencies from 
all Intel transactions during 2006. 
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Fig. 11. Empirical estimates of S„ at various values of p and sampling frequencies from 
all Microsoft transactions during 2006. 
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When the observations are blurred with either an additive white noise or 
with noise due to rounding, the respective limits are then 2 and y/2. Indeed, 
suppose that instead of observing the exact value of X we have on top of 
it an additive white noise, that is we observe Xi& n + Zj (at stage n, where 
the Zj's are i.i.d., and independent of the process X). If we suppose that 
Zj has a density which is continuous and nonvanishing at 0, then the noise 
is the leading factor in the behavior of B(p,u n , A n )x as soon as p/(2(p + 
1)) > P+ [recall (17)]. In this case, the variables (A n /un +1 )B(p, u n , A n )x 
converge in probability to TC p for some constant C p , and thus S n converges 
in probability to the sampling frequency ratio k, which is 2 here. When 
the noise is pure rounding at some level a n , then again it is the leading 
factor and \J A n an~ p B(p, u n , A u )t converges in probability to some positive 
limiting variable, as soon as a n /u n — > and a^A n — > oo. Thus S n converges 
in probability to Vk [when a n > u n we have B(p,u n , A h )t = and then 
S n is not even well defined; however, here the truncation level u n used in 
practice is quite bigger than the rounding level of 1 cent]. 

The values of a are similar to those employed in simulations, and indexed 
in terms of standard deviations of the continuous martingale part of the log- 
price: we first estimate the volatility of the continuous part of X using the 

a 1/2 

small increments, those of order A n , and then use that estimate to form 
the cutoff level used in the construction of the test statistic. To account for 
potential time series variation in the volatility process at, that procedure is 
implemented separately for each day and we compute the sum, for that day, 
of the absolute value of the increments that are smaller than the cutoff, to 
the appropriate power p. For the full year, we then add the truncated power 
variations computed for each day. 

The results in both Figures 10 and 11 tell a similar story. First, the empiri- 
cal estimates are always on the side away from the limit under H\ , indicating 
that the null hypothesis of a Brownian motion present will not be rejected. 
Second, as the sampling frequency decreases, the empirical values get closer 
to the theoretical limit under H$. For very high sampling frequencies, the 
results are consistent with some mixture of the noise driving the asymp- 
totics. They then slowly settle down toward the limit corresponding to a 
null hypothesis of a Brownian present as the sampling frequency decreases, 
and the noise presumably becomes less of a factor. 

Next, we turn to the results of the second test on the same data series in 
Figures 12 and 13. The test statistic S' n is implemented with 7 = 2, with data 
sampled every A n = 5 seconds. The empirical estimates are represented by 
a star, with the vertical dashes representing a 95% confidence interval. Also 
represented on the plots are the limits corresponding to Hq (no Brownian) 
and H\ (Brownian present). The theoretical limits correspond to those given 
in Figures 6 and 7, except that the theoretical limit under H± is plotted for 
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Fig. 12. Empirical estimates of S' n for various truncation levels a from all Intel trans- 
actions during 2006. 
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Fig. 13. Empirical estimates of S' n for various truncation levels a from all Microsoft 
transactions during 2006. 
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a value of /3 = 1.5 for the same reason as above. We find that the empirical 
estimates tend to be lower than the value specified by Hq, which leads to 
a rejection of the null hypothesis of no Brownian motion. The estimates 
are, however, generally higher than their expected value under Hi, consis- 
tent with the upward bias identified in simulations, the bias being more 
pronounced when (3 gets closer to 2. 

To summarize, the answer from both tests appears consistent with the 
presence of a continuous component in the data: using S n , we do not reject 
the null of a Brownian motion present, while using S' n we reject the null of 
its absence. 

6. Technical results. By a standard localization procedure, we can re- 
place the local boundedness hypotheses in our assumptions by a bounded- 
ness assumption, and also assume that the process X itself, and thus the 
jump process AAj, are bounded as well. That is, for all results which need 
Assumption 1 we may assume further that, for some constant C > 0, 

(37) \b t \,\a t \,L t ,\AX t \ <C, hence also F t {[-C, C] c ) = 0. 
When we need Assumption 2 we may assume the above, together with 

(38) \b t \,n u \a t \, j \x\F t (dx)<C. 

We call these reinforced Assumptions 1 or 2, and they are assumed in all 
the sequels instead of mere Assumptions 1 or 2, according to the case. 

Recall that if j3 < 1, we have (7) with b' t bounded as well. Otherwise the 
decomposition (7) is no longer valid, but under reinforced Assumption 1 we 
can always write 

(39) Y{ = X + [ b" s ds+ [ a s dW s , Y{' = X t -Yl, 

Jo Jo 

where b'{ = h + f xl^ x \ > ijF t (dx) defines a bounded process, and Y" is a 
purely discontinuous martingale. 

Also, K below denotes a constant which may change from line to line and 
may depend on C above. 

The key to all results is clearly the behavior of the processes B(p,u n , A n ) 
and U(u n , A n ). For establishing this behavior, it is convenient to introduce 
a few auxiliary processes, for u > an arbitrary cut-off level and Y an 
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arbitrary process 



(40) 



[t/A r 



B'(Y,p,A n ) t = ia?*t> 

i=l 

D ti>,u) t = \ AX s\ Pl {\AX s \<u}, 

s<t 

D \ u )t = Y~L{\AX s \>u}- 



6.1. Central limit theorems for the auxiliary processes. This subsection 
is devoted to recalling or proving some limit theorems for B'(X,p, A n ) and 
for the auxiliary processes introduced in (40). First, we recall from Theorem 
2.4 of [12] that under Assumption 1 (and even much more generally), 



(41) 



0<p<2 



Al-^B'(X,p,A n ) 



A(p) t 



m r 



'ds, 



p>2,X continuous 
k (u n ) satisfies (17) 



Al- p/2 B>(X,p,A n ) t ^A(p) t , 
B(2,u n ,A n ) t ^A{2) t 



[the last property is proved when u n = a A™ with a > and w £ (0, 1/2), 
but the proof works as well when (17) holds]. 

Lemma 1. Suppose that X is continuous, and let t > and p > 1 and 
k>2. Under Assumption 2 the two-dimensional variables 



(42) 



±=(A l -r/ 2 B>(X,p,A n ) t -A(p) 



u 



A)r /2 B\X,p, kA n ) t - fcP/ 2 -M(p) t ) 



stably converge in law to a limit which is defined on an extension of (£l,J-, 
(J~t)t>o, IP) and which, conditionally on T ', is a centered Gaussian variable 
with variance- covariance matrix given by 

(m 2p - m 2 )A{2p) T (m h>p - kP/ 2 m 2 p )A(2p) T ' 



(43) 



1 

m 2p 



( mfc _ fcp/2 m 2) A( 2p ) T kP- 1 (m 2p - ml)A{2p) 



T 



(The same would hold if p € (0, 1] , under the additional assumption that 
at is bounded away from 0.) 

Proof of Lemma 1. We can assume reinforced Assumption 2. The 
result will follow from Theorem 7.1 of [11]. Assumption (H) in that paper is 
slightly more restrictive than reinforced Assumption 2, but a close look at 
the proof yields that this theorem still holds in the present situation. 
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We apply the quoted Theorem 7.1 to the two-dimensional function on W k 
whose components are |xi| p + • • • + \xk\ p and \x\ + • • • + Xk\ p ■ This function 
is C 1 with derivatives having polynomial growth. With the notation of that 
paper, variable (42) with the nontruncated variations is equal to Z n + R n , 
where 



(44) 



1 U n V'{f,k,^n)t-\j\tU) ds 



and R n is a remainder term with second component equal to 0, and with 
first component 



(45) 



[t/A n ] 

A l/2-p/2 \AfX\P. 
i=k[t/kA n ]+l 



By (37) we have I?(|A™X| P ) < KA n , and hence, since there are at most k 

p 

summands in the definition of R n , we deduce that R n — > 0. On the other 
hand, the aforementioned result yields that Z n converges stably in law to 
a limiting variable, which is exactly as described in the statement of the 
lemma. □ 



Lemma 2. Let t>0, and suppose Assumption 1 and p > (3 and u n — > 0. 
Then 



(46) 



ut P D(p,u n ) t A -^— Q A U uiD\u n ) t A A t . 



p-0 



Moreover, if (3 f < (3/2 the four- dimensional variables 

fi-v 



( -^- 2 (-t P D(p,u n ) t -^A t \ \ 



(47) 



a,, 



A, 



1 



/3/2 
Un 



(u^D'(u n ) t - A t ) 



-^(( 7 u n ^D\j Un )t-At) 

Un 



stably converge in law to a limit which is defined on an extension of (Cl,J-, 
(J~t)t>o,P) and which, conditionally on T , is a centered Gaussian variable 



BROWNIAN MOTION OR NOT 



29 



with variance- covariance matrix AfC , where C is the 4x4 matrix 



( P 



(48) 



Ci 



2p-P 
2p~p 






2p-p 

Pi p 

/3(l-7 /3 ' p ) 

P~P 






P -f3 
1 
1 






1 



Proof. Assumption 1 here implies Assumption 6 of [2], with the same 
P and At, and with there substituted with any number in (P',P) here. 
Then all statements concerning D(p,u n )t are in Proposition 5 of that paper. 
However, we must redo the proof to obtain the joint convergence for the 
processes D(p,u n ) and D'(u n ). 

Let D(p,u) and D'(u) be the predictable compensators of D(p,u) and 
D'(u), and M{u) = u^ p {D(p,u) - D(p,u)) and M'{u) = u^{D'{u) - D'(u)). 
Observe that D'(u) t = jj F s ([-u,u] c ) ds and \F t ([-v,v] c ) -V 13 A t \ < KL t v~^' 
by Assumption 1. Therefore, exactly as in the paper (and (C.23) and (C.24) 
in it), we see that if q > j3, 



(49) 



P'<P 



ut q D(q,u. 



PA t 



n)t 



u^D'(u n ) t -> A t 



(3/2 
Un 



g D(q,u n ) t 



PA t 



-jj^\u^D' (u n ) t 



P 



A t \^0. 



The processes M(u) and M'{u) are martingales, and if u < v the brackets 
are given by the following formulas: 

(M{u),M(v)) = {uvf~ p D{2p,u), (M'(u),M'(v)) = (uvf£>'(v), 

(M(u),M'(v))=0, (M'(u),M(v))=u^v l3 - p {D(p,v)-D(p,u)). 

This, applied with (u,v) equal to (u n ,u n ) or {u n ,^u n ) or {^u n ^u n ), and 
combined with the first part of (49), yield that the bracket matrix at time 

/3 /2 

t of the 4-dimensional continuous martingale M n = u n (M (u n ) , M ( r yu n ) , 
M'(u n ), M'{pfu n )) converges to A t C in probability, where C is given by (48). 
Then as in Proposition 5 of [2] one deduces that M" converges stably in law 
to the limit described in the statement of the lemma. It remains to deduce 
from the second part of (49) that the difference between M" and the variable 
defined by (47) goes to in probability. □ 
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6.2. The behavior of B(p,u n ,A n )T- In this subsection we establish the 
behavior of B(p,u n , A n ) for the relevant values of p and for the cases not 
covered by (41). This is done in several lemmas. 

Lemma 3. Under Assumption 1, and if u n satisfies (17), we have 

(50) B(4,u n ,A n ) t AO. 

Proof. We first observe that B(4,v, A n )x converges in probability to 
G(v)t = ^2 s <t I A^"s| 4 1{|ax s |<u} f° r an y fixed v > such that P(3s < T : 
|AX S | = v ) = 0. Hence there is a sequence v m — > such that B(4,v m , A n )x 
converges in probability to G{v m )T- On the one hand B(A, u n , A n )x < 
-B(4, v m , A u )t as soon as u n < v m . On the other hand we have G(v m )T — > 
as m — > oo. Then the result follows. □ 

Lemma 4. Assume (17) and reinforced Assumption 1, and let p> 0. If 
either p<2, or p > 2 wrai/i p_ > 2 p_2/3 , we /lave 

(51) K~ p/2 B(p, u n ,A n ) t A A(p) t . 

Proof. We consider decomposition (39). In view of (41), it is enough 
to prove that under the conditions of the lemma we have 

(52) A l ~ p l 2 (B{p, u n , A n ) t - B'(Y',p, A n ) t ) A 0. 
The left-hand side above is A^ 1 X)|=i C™' where 

c = iA?r + A-y'ri { |A ? xi< u „ } - iA?yr- 

With k = 1 when p > 1 and k = otherwise, we have the following inequal- 
ities, for all m,q>0: 

1 1 

\a?y>\>^ =► \a<K\A?y\ p+q /<, 

(53) \A"X\>2u n , | A?y'| < ^ 

=> ici < \^i i y'\ p \KY"r/<\ 

|A"X|<2u n , \A?Y'\<^ 

\C\<K((\A?Y"\Au n r 

+ K\A?YT 1 (\A?Y"\Au n )), 

where we have used the inequality ||x + y| p — \ x\ p \ < K(\y\ p + \x\ p ~ 1 \y\) when 
p > 1 and \\x + y\ p — \x\ p \ < \y\ p when p < 1. In view of (37), we have the 



BROWNIAN MOTION OR NOT 



31 



estimates 

,nv»\2\ ISA „ \ n TCVI Anv/\q\ ^ ts A <?/ 2 



(54) 



A^/ 2 E(|CI)< 



E(|Afy"| 2 ) <KA n , q>0 E(\A*Y' |«) < K q A q n 

r G (0,2] =► E((|AfF"| Au n ) 2 ) < K T A n u n ^ 

(the first estimate is obvious and the second one follows from Burkholder- 
Davis-Gundy inequality; the third one follows from (6.25) of [11] applied to 
the process Y" and with a n = u n /yf A n , which goes to oo by (17), and with 
r as above). Then, using Holder's inequality and (|x| f\u n ) p < u^~ 2 (\x\ Au n ) 2 
when p > 2, we deduce from (53) applied with q = m = 1 and from u n < K 
that 

KA n + u^/V + KU \-r/A , if p < 2) 

KA n + Al-^ui-r + u W 2 ) , if p > 2. 

We have A\[ 2 ju n ^ by (17), hence £(A*~ p/2 E?i? ^ °' as soon as 
p < 2, or p > 2 and p_ > 2 (p"r) • Since r is arbitrary in (/3, 2], we deduce the 
result. □ 

Lemma 5. Let p G (0, 2], and assume reinforced Assumption 1 with (3 < 1 
and (17) with further p_ > 2 pZ^ w/ien p > 1. Then, with X' given by (7), 
we have 

(55) Ay 2 ^/ 2 (B(p,n n , A n ) t - B'(l' )ft A„) t )A0. 

Proof. The proof of this lemma is similar to that of the previous one. 
The left-hand side of (55) is A n l2 ~ p/2 ^!=f " ] Co where 

C = I a?x' + Afx"\n i]A n mun} - \AfXT. 

Then (53) holds with (X',X") instead of (Y',Y"), whereas (54) is replaced 
by 

E(\AfX"\) < KA n , q>0 E(| A? X'\«) < K q A q J 2 , 

re(/9,l) => E(|AfX"| An n )<i^ r A n <- r 

(we now use (6.26) of [11] applied with a n = u n /yj A n and r as above). 
Hence, using (53) for the pair (X',X"), plus the fact that (|sc| A u n ) p < 
Un m (\x\ A u n ) m for < m < p and Holder's inequality, we deduce that for 
all q > and m G (0, 1) and r G (/3, 1), and with ac as in the previous proof, 

Ay 2 -^ 2 E(ici) 

Al /2 ~ l/2 AT~ 1/2 



(56) 



<K r A n (A^+A^+A^ +K A^), 
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where V\ = — p + ) — \ and v 2 = m(l — p+r) — \ and V3 = + (p — r)p_ 
and V4 = m — 1 + p~(l — mr). Since p + < 1/2 we have v\ > for q large 
enough. When r J, /3 and m|l, we have i>2 — > v' 2 = (1 — P+/3) — \ and v' 3 —> 
^3 = -\. (jp — j3)p_ and — > v' A = (1 — P)p~, and (55) will follow from 
v'j > for j = 2, 3 and also for j = 4 when p > 1. We have i> 2 > because 
/3p + < \. When p < 1 we have v' 3 > 0. When p = 1 then U3 > if p_ > 0, 
and when p > 1 we have i> 3 > and v' A > as soon as p_ > 2 pZ^ • ^° (^^) ^ s 
proved. □ 



The previous lemma essentially gives the behavior of B(p,u n ,A n ) when 
the leading term is due to the continuous martingale part of X. When this 
part vanishes, we have another type of behavior, which we describe now. 



Lemma 6. Letp> 1, and assume reinforced Assumption 1. 



(i) Ifp>(3 and (17) holds with p + < ^— - we have 



(57) 



fi-v 



(B(p, u n , A n ) t - D(p, u n ) t 



on the set £1 



no W 



no W 



(ii) If p > 2 and (17) holds with p + < %jf, and if P' < /3/2, we have 
(58) u^ 2 - p (B(p,u n ,A n ) t -D(p,u n ) t )^0 on the set ^ oW ■ 

Proof. Since the variables B(p,u n , A n ) t are the same on the set Vt 
when they are computed on the basis of X or on the basis of the process 
Xt — L a s dW s , it is no restriction to assume that a s = identically. 

The proof is based on the result of [2] , when at = identically. We have 
Assumption 7 of that paper with H = f3 and a = 1 — /3'//3 and thus 4>'(x) = 
. We can then apply Lemmas 8 of that paper with the version of r)(p) n 
given at the end of Lemma 7 (because X c = here), to obtain that for 
p > 1 V fi and if p + < and for any r G (0, 



2 , 
3)' 



(59) 
where 



E(\B(p,u n ,A n ) t 



3p+/3 

D(p, Un ) t \)<K r tu p n -P V (p) n , 



i=i 



t 1 

XI = 

P+ 

x 3 =r(l 



/3(l + r), 
P. 



x 2 = /?(2 + 3r), 

P+ 

= 1 1, 2 5 =P-P- 

PP+ P 
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Clearly, (57) follows from (59), as soon as we can choose r € (0, 3 ^ +/3 — |) 
such that Xj > for all j = 1, . . . , 5: this is obvious when (3 <p and p+ < 2— 

As for (58), it will also follow from (59) if we can choose r as above, such 
that Xj > (3/2 for all j = 1, . . . , 5. This property holds for x§ because (3' < (3/2 
is assumed, and for £4 because p + < 1/2. For j = 1, 2, 3, and since x\ and X2 
do not depend on p and X3 increases with p, it is enough to consider the case 
p = 2. Then if we let r decrease strictly to 2~]j > we see that X3 > (3/2, whereas 

x\ and X2 increase to j- — an d to — ^^-p respectively, and these 

quantities are strictly bigger than (3/2 if p + is strictly smaller than 



and ft( 10 _^3) • Now, recall that one should also have < r < — |, which 
is possible if and only if p + < wi^rj ■ All these conditions on p + are ensured 
if P+ <^. □ 



6.3. The behavior of U(u n ,A n ). The behavior of U(u n ,A n ) has been 
exhibited in [4], including a central limit theorem. However, here we need 
a joint CLT, at least on the set Q^ oW n St^ j f° r the pair (U(u n , A n )y, 
Z?(2,it n , A n )), and even for this pair jointly with the similar pair with the 
truncation levels ^yu n . For this we will use Lemma 2, and we thus need to 
show that the difference U(u n ,A n ) — D'(u n ) is negligible, after a suitable 
normalization. To this effect, we use the contorted way of using the afore- 
mentioned CLT for U(u n , A u )t, but knowing this result it seems the shortest 
route toward the desired joint CLT. 



Lemma 7. Assume reinforced Assumption 1. 

(i) Under (17) we have 

(60) u^U(u n ,A n ) t 

(ii) If moreover (3" < and < f and (17) holds with p + < A A, 
then 

(61) u^ 2 (U(u n ,A n ) t -D'(u n ) t ) AO. 



Proof. In [4] the truncation level was set as u n = aA°. However, it 
is obvious that it works with any truncation level u n subject to (17), with 
the conditions on w replaced by exactly the same conditions on p + . With 
this in view, (i) follows from Proposition 1 of that paper. The proof of (ii) 
is much more involved, and broken into several steps. 

Step (1) We write U(u n , A n ) t - D'(u n ) t as H(l)% + H{2) 1 / - H(3)?, where 
£T(3) t » = D'(u n ) t - D'(u n ) An[t/An] and H(j)? = ^Ef" 1 C(j)™ for j = 1,2, 
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with 

C(l)" = l{A?D'(un)=0,\A?X\>Un}> 
C( 2 )f = l{A?D'(u n )>l,|AJ'X|>Un} - A™D'{u n ). 
In this step we prove 

(62) </ 2 iT(3)?A0. 

The left-hand side above is nonnegative, with expectation E(D' (u n )t — 

D (itnjAnft/An])' w hich is smaller than KA n /u n (see the proof of Lemma 2). 
Since p+(J3/2 + (3') < 3p + p/2 < 1 we deduce (62). 
Step (2) Let us assume for a moment that we have 

(63) rt2)?A0. 

In Proposition 2 of [4], and upon replacing aA" by u n , it is proved that 

under our assumptions on /?', /3" and p + , the sequence Z n = u n ^^ 2 {unU (u n ,A n ) t — 
At) converges in law to a limiting variable Wt which is centered. On the other 

hand, Lemma 2 yields that Z' n = u n ^ ^ 2 (unD' (u n )t — A t ) converges in law to 
a limiting variable W t which is also centered (and, indeed, has the same law 
as W t ). 

Up to taking a subsequence, assume that the pair (Zn^Z^) converges in 
law to a pair (Z, Z 1 ) of variables which are centered, whereas Z n — Z' n = 
u?, /2 (H(l)f + H{2) T t l ). In view of (63) it follows that ui /2 H{l) r t l converges 
in law to Z — Z' . Therefore, since by construction H (1)" > we must have 
Z — Z' > 0. Since Z — Z' is centered, we must have Z' = Z a.s. In other 
words, for any subsequence of (Z n ,Z' n ) which converges in law, the limit is 
a.s. 0, and by a subsequence principle it follows that the original sequence 
Z n — Z' n goes to in law, hence in probability; this obviously implies (61). 

At this stage, we are left to prove (63) which will be implied by the 
following: 

(64) E(u£/ 2 |C(2)r|)<A n <; n 

for a sequence v n —> 0. 

We recall the property (B.12) of [2]: denoting by i?", . . . , R^, . . . the suc- 
cessive jump times of D'{u n ) occurring after (i — l)A n (with any fixed i), 
we have P(R™ < iA n ) < iP ' A j n Un^ '. This implies 

F(A?D'(u n ) > 2) < KA 2 n u~^, E(A?D'(u n )l {A?D , {Un) > 2} ) < KA^u~ 213 . 

Since p + < 2/(3/3) we have A n jun^ 2 — > 0. Therefore, for proving (64) it 
remains to show that 

(65) U V 2 F(A?D'(u n ) = 1, \A?X\ < u n ) < A n v n . 
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Set 

X"(u n ) t = ]T AX s l {{AXsl>Un} , X'(u n ) = X- X"{u n ). 

s<t 

We have estimate (B.15) of [2] again, with H = (3 and 4>'(x) = x~@' . Thus, 
since on the set {AfD'(u n ) = 1} the process X"(u n ) is piecewise constant 
and with a single jump on the interval {(i — l)A n ,iA n ]}, and the size of this 
jump is bigger than u n , we deduce 

(66) P(A^'K) = 1, \AfX"(u n )\ < u n (l + w n )) < KA n (u^w n + uj') 

for any choice of the sequence w n decreasing to 0. 

Finally we use estimate (61) of [4] to obtain for all q > 2 

A 2 A q/2 

(67) F(A?D'{u n ) = 1, \A?X'{u n )\ > u n w n ) < K—^ + K q -^- T . 

W n U n W n U n 

Of course the left-hand side of (65) is smaller than u n times the sum of 
the left-hand sides of (66) and (67). Therefore, it remains to prove that we 
can choose the sequence w n and q > 2 in such a way that y n (j) — > for 
j = 1, 2, 3, 4, where 

Un 

A A q/2 ~ l 



Vn( 3 ) = „ . 2/n(4) 



3/3/2 2 ' q -/3/2 g" 

We have y n (l) — > by hypothesis. Upon taking u; n = for some r, this 
amounts to showing that one can find r > and q > 2 such that r > ^ and 
— — 2r > ^ and q — 2 > (q(2r + 2) — f3)p+. The last condition is satisfied 
for g large enough as soon as 2(r + l)p+ < 1. Then it is easy to see that the 
choice of r is possible if and only if p + < A A • □ 

6.4. Central limit theorems for B(p,u n , A n ) and U(u n , A n ) . The previ- 
ous results allow us to derive joint CLTs for the processes B(p,u n , A n ) and 
U(u n ,A n ), as required for Theorems 2 and 5. For the first of these two the- 
orems, we use the following proposition which follows from Lemmas 1 and 
5: 



Proposition 1. Let p e (1, 2] and t>0 and k>2. Under Assumption 
p-i 

2(p-/3) 



2 and (17) with p_ > 2 ? v -8) the two-dimensional variables 



^(A n ^ 2 B(p,u n ,A n ) t -A(p) t ,A n ^ 2 B(p,u n ,kA n ) t - k^- l A{p) t 
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stably converge in law to a limit which is defined on an extension of (O, J 7 , 
(J-t)t>o, P) an( ^ which, conditionally on T , is a centered Gaussian variable 
with variance- covariance matrix given by (43). 

For the second theorem, we use the following consequence of Lemmas 2, 
6 and 7: 



Proposition 2. Let t>0 and -y > 1, and suppose Assumption 1. 

(i) If u n — > we have 

u^U(u n ,A n ) t -^+A t . 

(ii) Ifp>(3 and (17) holds with p + < 2r ~— , we have 

(69) u^ p B(p, Un, A n ) t —^A t in restriction to the set £l? oW . 

P-P 

(hi) If further f3" < and ft' and if (17) holds with p + < A 



5/3 A -0-, the four- dimensional varia 



bles 



( 

1 



0/2 
Un 



uP- p B(p,u n ,A 



nit 



" At 



p-ft 



\ 



fill 



(~fu n f- p B(p,~fu n ,An) 



P At 



P-P 



V 



P/2 



-j^(u^U(u n ,A n ) t -A t ) 
in 

((ju n fU^u n ,A n ) t -A t ) 



u 



J 



stably converge in law, in restriction to the set Qf oW , to a limit which is 
defined on an extension of (fi, F, (Ft)t>o,P) and which, conditionally on T , 
is a centered Gaussian variable with variance-covariance matrix A±C , with 
C given by (48). 



6.5. Proof of the theorems. It remains to prove the main theorems, for 
which we can assume the reinforced assumptions if necessary, without re- 
striction. 

First, the consistency results (20) and (30) are obvious consequences of 
(51), (69) and (68), plus the facts that A T > on ti^ and A(p) T > on 

Second, in order to prove Theorem 2 we use Proposition 1 which, upon 
using the "delta method," shows that under the stated assumptions the 
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variables ^=(«S n - A^/ 2 " 1 ) converge stably in law, in restriction to flip , to 
a variable which conditionally on T is centered Gaussian with variance 

i p 

With y n given by (24), we have -%-V n — > V by (51), and the result readily 
follows. 

In the same way, Proposition 2 yields that -7J72 (S' n — 7 2 ) converges stably 



u 



in law, in restriction to ft T oW n fiy , to a variable which conditionally on T 
is centered Gaussian with variance 



7* ( P{2-Pf 
A T \ 4-/3 

If V£ is given by (32), then -^rK ^' in restriction to fij ^ n by (69) 

and (68). This finishes the proof of Theorem 5. 

Finally, for both Theorems 3 and 6, the claims concerning the asymptotic 
level of the tests are trivial consequences of two central limit Theorems 2 
and 5. It remains to prove that the asymptotic power is 1 in both cases. By 
virtue of (20) and (30), this will follow from the next two properties, under 
the appropriate assumptions 



(70) 




on the set n T oW nfij?, 
on the set fljf n fltf . 



The first of these properties follows from (69), and the second one follows 
from (41), (50) and (68). 
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